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Abstract 

The non-linear invariance principle of Mossel, O’Donnell and Oleszkiewicz establishes that if f{xi,x„) 
is a multilinear low-degree polynomial with low influences then the distribution of f{Bi,..., Bn) is close 
(in various senses) to the distribution of f{Gi, ■ ■ ■ ,Gn), where Bi en {—1,1} are independent Bernoulli 
random variables and Gi ~ N(0, 1) are independent standard Gaussians. The invariance principle has seen 
many application in theoretical computer science, including the Majority is Stablest conjecture, which 
shows that the Goemans-Williamson algorithm for MAX-GUT is optimal under the Unique Games 
Conjecture. 

More generally, MOO’s invariance principle works for any two vectors of hypercontractive random 
variables {Xi,..., Xn), (Ti,. • •, Tn) such that (i) Matching moments-. Xi and Xi have matching first and 
second moments, (ii) Independence-, the variables Xi ,..., Xn are independent, as are ^r, • ■ •, Tn- 

The independence condition is crucial to the proof of the theorem, yet in some cases we would like 
to use distributions (Xi,..., X„) in which the individual coordinates are not independent. A common 
example is the uniform distribution on the slice which consists of all vectors (xi,... ,x„) e {0,1}" 
with Hamming weight k. The slice shows up in theoretical computer science (hardness amplification, 
direct sum testing), extremal combinatorics (Erdos-Ko-Rado theorems) and coding theory (in the guise 
of the Johnson association scheme). 

Our main result is an invariance principle in which (Xi ,..., Xn) is the uniform distribution on a slice 
and {Vi,. ■ ■ ,yn) consists either of n independent Ber(p) random variables, or of n independent 
N(p,p(l —p)) random variables. As applications, we prove a version of Majority is Stablest for functions 
on the slice, a version of Bourgain’s tail theorem, a version of the Kindler-Safra structural theorem, and 
a stability version of the t-intersecting Erdds-Ko-Rado theorem, combining techniques of Wilson and 
Friedgut. 

Our proof relies on a combination of ideas from analysis and probability, algebra and combinatorics. 

In particular, we make essential use of recent work of the first author which describes an explicit Fourier 
basis for the slice. 


1 Introduction 


Analysis of Boolean functions is an area at the intersection of theoretical computer science, functional analysis 
and probability theory, which traditionally studies Boolean functions on the Boolean cube {0,1}”. A recent 
development in the area is the non-linear invariance principle of Mossel, O’Donnell and Oleszkiewicz [26) . a 
vast generalization of the fundamental Berry-Esseen theorem. The Berry-Esseen theorem is a quantitative 
version of the Central Limit Theorem, giving bounds on the speed of convergence of a sum Xi to the 
corresponding Gaussian distribution. Convergence occurs as long as none of the summands Xi is too “promi¬ 
nent” . The invariance principle is an analog of the Berry-Esseen theorem for low-degree polynomials. Given 
a low-degree polynomial f on n variables in which none of the variables is too prominent (technically, / has 
low influences), the invariance principle states that the distribution of f{Xi ,..., Xn) and f(Yi ,..., F„) is 
similar as long as each of the vectors (ATi,..., Xn) and (Yi,..., Y„) consists of independent coordinates, the 
distributions of Xi, Yi have matching first and second moments, and the variables Xi, Yi are hypercontractive. 

The invariance principle came up in the context of proving a conjecture. Majority is Stablest, claiming 
that the majority function is the most noise stable among functions which have low influences. It is often 
applied in the following setting: the Xi are skewed Bernoulli variables, and the Ij are the matching normal 
distributions. The invariance principle allows us to analyze a function on the Boolean cube (corresponding 
to the Xi) by analyzing its counterpart in Gaussian space (corresponding to the Yj), in which setting it can 
be analyzed using geometric methods. This approach has been used to prove many results in analysis of 
Boolean functions (see for example m)- 

The proof of the invariance principle relies on the product structure of the underlying probability spaces. 
The challenge of proving an invariance principle for non-product spaces seems far from trivial. Here we 
prove such an invariance principle for the distribution over Xi,..., Xn which is uniform over the slice 
dehned as: 

^ • • ■ ’ ^ ^0,1}” : cci H- \-Xn = k}. 

This setting arises naturally in hardness of approximation, see e.g. [6], and in extremal combinatorics (the 
Erdds-Ko-Rado theorem and its many extensions). 

Our invariance principle states that if / is a low-degree function on having low influences, then 
the distributions of f{Xi,..., Xn) and /(Yi,..., Yn) are close, where Xi,..., Xn is the uniform distribution 
on and Yi,... ,Yn are either independent Bernoulli variables with expectation k/n, or independent 

Gaussians with the same mean and variance. 

The classical invariance principle is stated only for low-influence functions. Indeed, high-influence func¬ 
tions like f{xi,... ,Xn) = xi behave very differently on the Boolean cube and on Gaussian space. For 
the same reason, the condition of low-influence is necessary when comparing functions on the slice and on 
Gaussian space. 

The invariance principle allows us to generalize two fundamental results to this setting: Majority is 
Stablest and Bourgain’s tail bound. Using Bourgain’s tail bound, we prove an analog of the Kindler-Safra 
theorem, which states that if a Boolean function is close to a function of constant degree, then it is close to 
a junta. 

As a corollary of our Kindler-Safra theorem, we prove a stability version of the t-intersecting Erdds-Ko- 
Rado theorem, combining the method of Eriedgut m with calculations of Wilson [32] ■ Eriedgut showed that 
a t-intersecting family in of almost maximal size (1 — e)(^~*) is close to an optimal family (a t-star) as 
long as A < k/n < l/{t -I- 1) — ^ (when k/n > l/{t + 1), t-stars are no longer optimal). We extend his result 
to the regime k/n as I/(t -(- 1). 

The classical invariance principle is stated for multilinear polynomials, implicitly relying on the fact that 
every function on {0,1}" can be represented (uniquely) as a multilinear polynomial, and that multilinear 
polynomials have the same mean and variance under any product distribution in which the individual factors 
have the same mean and variance. In particular, the classical invariance principle shows that the correct way 
to lift a low-degree, low-influence function from {0,1}" to Gaussian space is via its multilinear representation. 
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The analogue of the collection of low degree multilinear functions on the discrete cube is given by the 
collection of low degree multilinear polynomials annihilated by the operator X;"=i Dunkl [aiiQ] showed 
that every function on the slice has a unique representation as a multilinear polynomial annihilated by the 
operator We call a polynomial satisfying this condition a harmonic function. In a recent paper m. 

the first author showed that low-degree harmonic functions have similar mean and variance under both the 
uniform distribution on the slice and the corresponding Bernoulli and Gaussian product distributions. This 
is a necessary ingredient in our invariance principle. 

Our results also apply for function on the slice that are not written in their harmonic representation. 
Starting with an arbitrary multilinear polynomial /, there is a unique harmonic function / agreeing with / 
on a given slice. We show that as long as / depends on few coordinates, the two functions / and / are close 
as functions over the Boolean cube. This implies that / behaves similarly on the slice, on the Boolean cube, 
and on Gaussian space. 

Our proof combines algebraic, geometric and analytic ideas. A coupling argument, which crucially relies 
on properties of harmonic functions, shows that the distribution of a low-degree, low-influence harmonic 
function / is approximately invariant when we move from the original slice to nearby slices. Taken together, 
these slices form a thin layer around the original slice, on which / has roughly the same distribution as on 
the original slice. The classical invariance principle implies that the distribution of / on the layer is close to 
its distribution on the Gaussian counterpart of the layer, which turns out to be identical to its distribution 
on all of Gaussian space, completing the proof. 

A special case of our main result can be stated as follows. 

Theorem 1.1. For every e > 0 and integer d ^ 0 there exists t = r(e, d) > 0 such that the following holds. 
Let n > 1/r, and let f be a harmonic multilinear polynomial of degree d such that with respect to the uniform 
measure Vpn on the sliee (^^), the varianee of f is at most 1 and all influences of f are bounded by r. 

The CDF distance between the distribution of f on the slice Vpn and the distribution of f under the 
product measure jip with marginals Ber(p) is at most e: for all ct e R, 

I Pr[/ < cr] - Pr[/ < cr]| < e. 

^pn 

This result is proved in Section [^1^ 

Subsequent to this work, the first and third author came up with an alternative proof of Theorem ll.il |25j 
which doesn’t require the influences of / to be bounded. The proof is completely different, connecting the 
measures Pp and Vpn directly without recourse to Gaussian space. While the main result of [5^ subsumes 
the main result of this paper, we believe that both approaches have merit. Furthermore, the applications of 
the invariance principle appearing here are not reproduced in [25]. 

Paper organization An overview of our main results and methods appears in Section |5| Some prelimi¬ 
naries are described in Section |31 We examine harmonic multilinear polynomials in Section |3| We prove the 
invariance principle in Section |5| Section |6| proves Majority is Stablest, and Section |3 proves Bourgain’s tail 
bound, two applications of the main invariance principle. Section |8| deduces a version of the Kindler-Safra 
theorem from Bourgain’s tail bound. Our stability result for t-intersecting families appears in Section |9l 
Some open problems are described in Section fTOl 

2 Overview 

The goal of this section is to provide an overview of the results proved in this paper and the methods 
used to prove them. It is organized as follows. Some necessary basic definitions appear in Subsection 12.11 
The invariance principle, its proof, and some standard consequences are described in Subsection 12.21 Some 
applications of the invariance principle appear in Subsection [231 versions of Majority is stablest, Bourgain’s 
theorem, and the Kindler-Safra theorem for the slice. An application of the Kindler-Safra theorem to 
extremal combinatorics is described in Subsection 12.41 Finally, Subsection 12.51 presents results for non¬ 
harmonic multilinear polynomials. 
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2.1 Basic definitions 

Measures Our work involves three main probability measures, parametrized by an integer n and a prob¬ 
ability p G (0,1): 

• pLp is the product distribution supported on the Boolean cube {0,1}" given by Pp{S) = 

• i/pn is the uniform distribution on the slice = {{xi,... ,Xn) e {0,1}" : xi + ■ ■ ■ + Xn = pn} (we 
assume pn is an integer). 

• Qp is the Gaussian product distribution N((p,... ,p),p(l — p)/„) on Gaussian space K". 

We denote by H/Utt the L2 norm of the polynomial / with respect to the measure tt. 


Harmonic polynomials As stated in the introduction, we cannot expect an invariance principle to hold 
for all multilinear polynomials, since for example the polynomial xi + ■ ■ ■ + Xn — pn vanishes on the slice but 
not on the Boolean cube or on Gaussian space. We therefore restrict our attention to harmonic multilinear 
polynomials, which are multilinear polynomials / satisfying the differential equation 


E 

2=1 


dxi 


= 0 . 


(The name harmonic, whose common meaning is different, was lifted from the literature.) 

Dunkl [HJ [TU] showed that every function on the slice has a unique representation as a harmonic 
multilinear polynomial whose degree is at most min(pn, (1 —p)n). This is the analog of the well-known fact 
that every function on the Boolean cube has a unique representation as a multilinear polynomial. 

One crucial property of low-degree harmonic multilinear polynomials is invariance of their L2 norm: for 
any p ^ 1/2 and any harmonic multilinear polynomial / of degree d ^ pn, 

This is proved in Filmus |13] . and in fact this result (and its applications in the present work) was the main 
motivation for m- 


Influences The classical definition of influence for a function / on the Boolean cube goes as follows. Define 
/W(a;) = /(a;W), where a;W results from flipping the fth coordinate of x. The ith cwfee-influence of / is given 

by 


Inf/[/] = ||/-/W||2^ = 


df 


dxi 


1 


p(l - p) 




Mp ^i€S 

This notion doesn’t make sense for functions on the slice, since the slice is not closed under flipping 
of a single coordinate. Instead, we consider what happens when two coordinates are swapped. Define 
/(b)( 2 ;) = /(xbj)), where results from swapping the zth and jth coordinates of x. The (i,/)th slice- 
influence of / is given by 

Inf^[/]= E[(/-/(b))2]. 

^pn 

The influence of a single coordinate i is then defined as 

1 


Initlf] = - ^ Inf^[/]. 


i=i 

The two definitions are related: Lemma [5^ shows that if d = 0{-s/n) then 

Inf«[/] = Op ("-¥[/] +Inf*[/] 
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(The variance can be taken with respect to either the Boolean cube or the slice, due to the L2 invariance 
property.) 

Noise stability The classical definition of noise stability for a function / on the Boolean cube goes as 
follows: 

§^[/]=E[/(x)/(y)], 

where x ~ jip and y is obtained from x by letting yi = Xi with probability p, and yi ~ Pp otherwise. 

The analogous definition on the slice is slightly more complicated. For a function / on the slice, 

S;[f] = E[/(a:)/(y)], 

where x ~ Vpn and y is obtained from x by doing Po(^^^^logi) random transpositions (here Po(A) is 
a Poisson distribution with mean A). That this definition is the correct analog can be seen through the 
spectral lens: 

Sp[/] = §p[/] = 

d d 

Here is the dth homogeneous part of / consisting of all monomials of degree d. 


2.2 Invariance principle 

Our main theorem is an invariance principle for the slice. 

Theorem 15.81 Let / be a harmonic multilinear polynomial of degree d such that with respect to 
V[/] ^ 1 and Inf®[/] ^ r for all i e [n]. Suppose that r ^ and n ^ IpIS^, for some constants Ip,K. 

For any C-Lipschitz functional tp and for tt e {f/p,Pp}, 

I E[Vz(/)]-E[Vz(/)]| = Op(C<5). 

l^pn ^ 


Proof sketch. Let ip he & Lipschitz functional and / a harmonic multilinear polynomial of unit variance, 
low slice-influences, and low degree d. A simple argument (mentioned above) shows that / also has low 
cube-influences, and this implies that 


E[Vz(/)]« E[Vz(/)]±Op 


\k — np\ 


■v~d\. 


In 


The idea is now to apply the multidimensional invariance principle jointly to / and to S' = 

V'pll-pln 

deducing 

E[z/-(/)l|S|^<,] = E[V-(/)l|s|=;.]±e. 

Mp »p 

Let ^p,q be the restriction of Qp to the Gaussian slice {(xi,... ,Xn) e R” : xi + ■ ■ ■ + Xn = qn}. An easy 
argument shows that since / is harmonic, the distribution of f{Gp) and f{'^p,q) is identical, and so 

E[Vz(/)l|s|^.] = Pr[|S|<a]E[Vz(/)]. 

t/p Vp »p 


Similarly, 

Since Pr 0 p[|S| ^ cr] « Prpp[|S| ^ cr] = 0p(cr), we can conclude that 


E[z/-(/)l|S|^.] = Pr[|S| cr]{E[iP{f)] ± Op{aVd)). 

f-Lp fJjp fJip 


E[iPif)]^E[iPif)]±Op(aVd+-). 

Vprv Gp V (T / 

By choosing tr appropriately, we balance the two errors and obtain our invariance principle. 


□ 
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As corollaries, we bound the Levy and CDF distances between f{vpn), and f{Gp)- 

Corollary 15.91 Let / be a harmonic multilinear polynomial of degree d such that with respect to i^pn, 
V[/] ^ 1 and Inf*[/] ^ r for all i e \n\. There are parameters Xp,X such that for any 0 < e < 1/2, if 
T ^ X~^e^ and n > X^/e^ then the Levy distance between f(i^pn) and /(tt) is at most e, for tt e {Gp,fip}. 
In other words, for all a, 


Pr[/ ^ CT - e] - e ^ Pr[/ ^ ct] ^ Pr[/ ^ cr + e] + e. 

I'pn tt l^pn 

Corollary I5.11L Let / be a harmonic multilinear polynomial of degree d such that with respect to 
V[/] = 1 and Inf*[/] ^ r for all i e [n]. There are parameters Yp,Y such that for any 0 < e < 1/2, if 
T ^ {Ypd)~^e^'^ and n > {Ypd)^/e^‘^ then the CDF distance between fi^pn) and /(tt) is at most e, for 
TT e {t/p, /ip}. In other words, for all a, 


I Pr[/ sg cr] - Pr[/ cr]| sg e. 

l^pn TT 

The proofs of these corollaries closely follows the proof of the analogous results in 


2.3 Applications 

As applications to our invariance principle, we prove analogues of three classical results in analysis of Boolean 
functions: Majority is stablest; Bourgain’s theorem; and the Kindler-Safra theorem: 

Theorem 16.31 Let /: ^ [0,1] have expectation /r and satisfy Inf*[/] ^ r for all i e [n]. For any 

0 < p < 1, we have 


Sp[/] rp(p) + Op, 


/log log; 


V logi 


+ Op ( - ) , where a = min(r, 


where rp(/i) is the probability that two p-correlated Gaussians be at most $ ^(p) (here $ is the CDF of a 
standard Gaussian). 

Theorem 17.21 Fix k > 2. Let /: ^ {+1} satisfy Inf®[/*®^] ^ r for all z e [n]. For some constants 

lFp,fe,0, if T ^ and n > Wp,fc/V[/]‘^ then 

Theorem 18.51 Fix the parameter fc > 2. Let /: ^ { + 1} satisfy ||/^^P = £■ There exists a function 

h: (^’(1) ^ {+1} of degree k depending on Ok,p{l) coordinates (that is, invariant under permutations of all 
other coordinates) such that 

||/-hp = Op,fc 


7 I/C 


for some constant C. 

The proof of Theorem 16.31 closely follows its proof in [55]. The proofs of the other two theorems closely 
follows analogous proofs in ED- 


2.4 t-Intersecting families 

As an application of our Kindler-Safra theorem, we prove a stability result for t-intersecting families. 
First, a few definitions: 
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• A t-intersecting family T c is one in which \A n B\ ^ t for any A,B e 

• A f-star is a family of the form {A e : A 3 J}, where | J| = t. 

• A (t, 1)-Frankl family is a family of the form {A e : |A n J| > t + 1}, where \J\ = t + 2. 

Ahlswede and Khachatrian mm proved that if n > (t + l)(fc — t + 1) and T is an intersecting family, 
then |J^| sg and furthermore equality holds if and only if is a t-star. They also proved that when 

n = {t + l){k — t + 1) the same upper bound holds, but now equality holds for both t-stars and {t, 1)-Frankl 
families. 

A corresponding stability result was proved by Friedgut m-- 

Theorem 19.21 Let t > 1, fc ^ t. A, C > 0, and Xn < k < (— C,)n. Suppose T 3 is a t-intersecting 
family of measure \T\ = (^Zj) ~ Then there exists a family Q which is a t-star such that 


l-^Ag| 

C) 


- Ot,A.c(e)- 


Friedgut’s theorem requires k/n to be bounded away from l/(t + 1). Using the Kindler-Safra theorem 
on the slice rather than the Kindler-Safra theorem on the Boolean cube (which is what Friedgut uses), we 
can do away with this limitation: 


Theorem 19.31 Let f>2, and n = {t + l){k — t + 1) + r, where r > 0. Suppose that k/n > A for 

some A > 0. Suppose T 3 is a t-intersecting family of measure IJ^I = (^Z*) ~ Then there exists a 
family Q which is a t-star or a {t, 1)-Frankl family such that 


l-^Ag| 

il) 


= o, 


t,X 


^max 





for some constant C. 

Furthermore, there is a constant At^\ such that e ^ At^\rauY{r/k, 1)'^+^ implies that is a t-star. 


Our proof closely follows the argument of Friedgut transplanting it from the setting of the Boolean 
cube to the setting of the slice, using calculations of Wilson [32] in the latter setting. The argument involves 
certain subtelties peculiar to the slice. 


2.5 Non-harmonic fnnctions 


All results we have described so far apply only to harmonic multilinear polynomials. We mentioned that some 
of these results trivially don’t hold for some non-harmonic multilinear polynomials: for example, Xi — np 

doesn’t exhibit invariance. This counterexample, however, is a function depending on all coordinates. In 
contrast, we can show that some sort of invariance does apply for general multilinear polynomials that 
depend on a small number of coordinates: 

Theorem 14.31 Let / be a multilinear polynomial depending on d variables, and let / be the unique harmonic 
multilinear polynomial agreeing with / on where d ^ pn ^ n/2. For tt g {pp, Qp} we have 

Proof sketch. Direct calculation (appearing in Lemma |4.2I1 shows that if w is a Fourier character than 


IIW-WII^^ = ||W' 


= O 


p{l — p)n 
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where w is defined analogously to /. 

We can assume without loss of generality that / depends only on the variables in [d] = {1,..., d}. Since 


/ fWl ^ Ksfo o 


j2nd 


using the Cauchy-Schwartz inequality. □ 

The idea of the proof is to prove a similar results for Fourier characters (Lemma 14.2p for individual 
Fourier characters, and then to invoke the Cauchy-Schwartz inequality. 

As a consequence, if we have a multilinear polynomial / depending on a small number of variables, its 
harmonic projection / (defined as in the theorem) has a similar expectation, L2 norm, variance and noise 
stability fCorollary 14.41) . This implies, for example, that our Majority is stablest theorem is tight: the 
harmonic projection of the majority of a small number of indices serves as the tight example. 


3 Preliminaries 

Notation The notation l^; is the characteristic function of the event E. Expectation, variance and covari¬ 
ance are denoted by E, V and Cov, respectively. The sign function is denoted sgn. The notation [n] denotes 
the set {!,..., n}. The slice (^)(^) consists of all subsets of [n] of cardinality k. We often identify subsets of 
[n] with their characteristic vectors in {0,1}". 

The notation Bin(n,p) denotes a binomial distribution with n trials and success probability p. The 
notation Po(A) denotes a Poisson distribution with expectation A. The notation N(/i, E^) denotes a normal 
distribution with mean /i and covariance matrix E^. For a scalar p, we use p to denote a constant p vector 
(of appropriate dimension which is clear from context) and to denote the n x n identity matrix. 

For a probability distribution tt, ||/|| = ||/||,n. = yE))])/^ is the L2 norm of / with respect to tt. Note that 

ll/lli = E[|/|]- 

The notation a- denotes the falling factorial function: a- = a(a — 1) (a — b + 1). 

Asymptotic notation (O(-) and the like) will always denote non-negative expressions. When the expression 
can be positive or negative, we use the notation ±0(-). The underlying limit is always n —> co. If the hidden 
constant depends on variables V, we use the notation Ov(-). 

A C-Lipschitz functional is a function '0: K. —> K satisfying \'tp{x) — ■i/’(y)| ^ C\x — yj, which implies that 
for functions /, g on the same domain: 

Lemma 3.1. For every C-Lipschitz functional if and functions f,g on the same domain, 

m{f)]-E[fz{g)]\^C\\f-g\\. 

Probability distributions Our argument will involve several different probability distributions on R." 
(where n will always be clear from context): 

• pp is the product distribution supported on {0,1}" given by Pp{S) = pl‘^l(l — p)”“l'®l. 

• izk is the uniform distribution on the slice 

• Qp is the Gaussian product distribution N((p,... ,p),p(l — p)/„). 

• 7 p,, = N((q,... ,g),E), where Ei,^ = ^p{l - p)6{i = j) - j) for 1 ^ i,j ^ n. 

As is well-known, the distribution 7 p_g results from conditioning Qp on the sum being qn. 

Lemma 3.2. Let {Xi ,..., Xn) ~ Qp. The distribution of (ATi,..., X„) conditioned on Xi Xn = qn 

is Jp^q. 
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Proof. Let S = Xi + • • • + and consider the multivariate Gaussian distribution {Xi ,..., S), whose 

distribution is easily calculated to be N((p pn) ,p{l —p) )• Let (Yi,..., ¥„) be the distribution of 

(Xi,..., Xn, S) conditioned on 5" = qn, which is well-known to be multivariate Gaussian. Using well-known 
formulas, the mean of this distribution is p -I- \n~^{qn — pn) = q (as can be derived directly), and its 
covariance matrix is p{l — p){In — ln“^l'). The diagonal elements are V[Yi] = p{l — p){l — L) and the 
off-diagonal ones are Cov{Yi,Yj) = p(l — p)(—L). □ 

We can also go in the other direction. 

Lemma 3.3. Let {Xi ,..., Xn) ~ "fp.q, Y ~ N(p—g, and let Yi = Xi + Y. Then {Yi ,..., Y„) ~ Qp. 

Proof. As is well-known, Yi,..., Y„ is a multivariate Gaussian, and it is easy to see that its mean is p. We 
have V[Yi] = V[AJ + V[Y] = p(l — p) and Gov(Yi, Yj) = Cov(Ai, Xj) + V[Y] = 0. The lemma follows. □ 

The distributions pp and nk are very close for events depending on o{'\/ri) coordinates. 

Lemma 3.4. Let A he an event depending on J coordinates, where ^ n. Then 

Wpn{A) - Pp{A)\ ^ 


Proof. The triangle inequality shows that we can assume that A is the event Xi 
1 for some £. Let k = pn. Glearly Pp{A) = (1 — p)^p'^~^, whereas 


^pn (A) 


(n - k)^kd^ 


X£ 0, ..., xj 


We have 


Therefore 


\ n 


J-1 




n 


^pn (A) ^ 


(n — kYk 


j-i 


n-^{l — J^/{2n)) 


< 


Pp{A) ( 1 -f 


using ^ 1 4- 2x, which is valid for x ^ 1/2. 
Similarly, 


JY_ 

2n 


(n - k)^k^ ^ e (J - ^ - 1 

{n — k)^k^ '' 2{1 — p)n 2pn '' \2{1 — p)n'2pn) ^ Ap{l — p)n' 


Therefore 




n'- 


4p(l — p)n 


This completes the proof. 


□ 


3.1 Harmonic multilinear polynomials 

Our argument involves extending a function over a slice to a function on R", just as in the classical 
invariance principle, a function on {0,1}” is extended to R" by writing it as a multilinear polynomial. In our 
case, the correct way of extending a function over a slice to R” is by interpreting it as a harmonic multilinear 
polynomial. Our presentation follows m, where the proofs of various results claimed in this section can be 
found. The basis in Definition O below also appears in earlier work of Srinivasan [30] . who constructed it 
and showed that it is orthogonal with respect to all exchangeable measures. 















Definition 3.5. Let / e R[a:i,... ,a:„] be a formal polynomial. We say that / is multilinear if 
all i e [n]. We say that / is harmonic if 




dx^ 


= 0 for 


The somewhat mysterious condition of harmonicity arises naturally from the representation theory of the 
Johnson association scheme. Just as any function on the Boolean cube {0,1}” can be represented uniquely 
as a multilinear polynomial (up to an affine transformation, this is just the Fourier-Walsh expansion), every 
function on the slice can be represented uniquely as a harmonic multilinear polynomial, using the 

identification 

={(xi,...,x„)e{0,ir:|;x. = fc}. 

' ' i=^ 


Lemma 3.6 f |131 Theorem 4.1]). Every real-valued function f on the slice can be represented uniquely 
as a harmonic multilinear polynomial of degree at most min(fc,n — k). 


There is a non-canonical Fourier expansion defined for harmonic multilinear polynomials. 

Definition 3.7. Let A = (oi,..., Od) and B = (6i,..., fe^) be two sequences of some common length d of 
distinct elements of [n]. We say that A < B A. 

(a) A and B are disjoint. 

(b) B is monotone increasing: bi < ■ ■ ■ < bd- 

(c) Oi < bi for all i e [d]. 

A sequence B = {bi,... ,bd) is a. top set ii A < B for some sequence A. The collection of all top sets of 
length d is denoted Bn,d, and the collection of all top sets is denoted 
If A = (oi,..., Od) and B = {bi,... ,bd) satisfy A < B, define 


d 

XA,B = - Xbi). 

i=l 


For a top set B, define 


Finally, define 


XB = Yi ^AB- 

A<B 


Xd — X{2,4....,2d}- 


Lemma 3.8 f [131 Theorem 3.1,Theorem 3.2]). Let tt be any exchangeable distribution on R" (that is, tt 
is invariant under permutation of the coordinates). The collection Bn forms an orthogonal basis for all 
harmonic multilinear polynomials in ]R[a:i,... ,Xn\ (with respect to tt), and 


WxbWI = cb||x|b|||^, where cb 


1=1/6. - 2(. - in 

2=1 ' ' 


and IIxbIItt denotes the norm of xb with respect to tt. 

In particular, if f is a harmonic multilinear polynomial then E[/] is the same under all exchangeable 
measures. 

Lemma 13.31 and Lemma 13.81 put together have the surprising consequence that harmonic multilinear 
functions have exactly the same distribution under Qp and Xp,q- 
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Lemma 3.9. Let f be a harmonic multilinear polynomial. The random variables f{Gp), f{lp,q) are identically 
distributed. 

Proof. According to Lemma |3.31 if (xi,...,Xn) ~ 7p,g, y ~ N(g — p, ) and yt = Xi + y, then 
{yi,-- ■ ,yn) ~ Gp- The lemma follows since yi — yj = Xi — Xj and a harmonic multilinear polynomial 
can be expressed as a function of the differences Xi — Xj for all i, j. □ 

Lemma [3.8l allows us to compare the norms of a harmonic multilinear function under various distributions. 

Corollary 3.10. Let 7ri,7r2 be two exchangeable distributions on R", and let f be a harmonic multilinear 
polynomial of degree d. If for some e > 0 and allO ^ d it holds that {l — e)\\xe\\'i^ < IIXe ||^2 ^ (l + £)IIXe||wi> 
then also (1 - e)\\f\\l^ ^ \\f\\l^ ^ (1 + e)\\f\\l^. 

The following lemma records the norms of basis elements for the distributions considered in this paper. 
Lemma 3.11. For all d we have 




IIXdL„„ — 2 


^{pn)H{l-p)n)i 


^2d 


= {2p{l-p))^{l±0 


d^ 


p{l-p)n 


Proof. The exact formulas for ||xd||^p and are taken from [131 Theorem 4.1]. Since Xi,...,Xn are 

independent under Gp, we have ||xd||pp = ]E[(a;i — X2)^Y = (2p(l — p)Y- 

It remains to prove the estimate for ||xd||^p„- The proof of [T31 Theorem 4.1] shows that 




fi-^) 

Mp) 


It follows that 


llXdlli 


{2p{l-p)Y 




pn (1 — p)n n 


+^|.i±o 


p{l-p)nj ■ 


□ 


Lemma 13.111 and Corollary 13.101 imply an L2 invariance principle for low degree harmonic multilinear 
polynomials. 


Corollary 3.12. Suppose f is a harmonic multilinear polynomial of degree d onn variables. For any p ^ 1/2 
such that d ^ pn and any ir e {/ip, Gp} we have 


WfU 


Wfl 


1 + 0 


TT -L -L 


d^ 


p{l-p)n 


3.2 Analysis of functions 

We consider functions on three different kinds of domains: the Boolean cube {0,1}", the slice and 

Gaussian space R.". We can view a multilinear polynomial in R[a:i,..., Xn} as a function over each of these 
domains in the natural way. 

For each of these domains, we proceed to define certain notions and state some basic results. The material 
for the Boolean cube and Gaussian space is standard, and can be found for example in [28] . 
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Functions on the Boolean cube The Boolean cube is analyzed using the measure fip for an appropriate 
p. The Fourier characters ujs and Fourier expansion of a function / are given by 


We define = l]| 5 |=/c f{S)u}s, and so a multilinear polynomial / of degree d can be decomposed as 
/ = /^°+- • Since the Fourier characters are orthogonal, the parts ..., are orthogonal. In the 

future it will be convenient to separate / into / = + for an appropriate k, where + - • ■ + f^^ 


>k £ = k-\-l 


and />'' = / 


+ r 


Define f^^\x) = /(a;W), where a;W results from flipping the ith coordinate of x. The ith cube-influence 
is given by 


Inf ■ [/] = 11/ - / 


W ||2 _ 


df 


^x^ 




The total influence of / is Inf‘^[/] = Yh=i Infi[/]) and it satisfies the Poincare inequality 

V[/] < p(l - p) Inf^[/] ^ (deg /) V[/]. 

The noise operator Tp is defined by 


The noise stability of / at p is 


deg/ 

Tpf = 2 p^n. 

2 = 0 

deg/ 

%[f]={f,Tpf)= Y, pV 

i=0 


=i||2 


The noise operator (and so noise stability) can also be defined non-spectrally. We have {Tpf){x) = E[/(?/)], 
where y is obtained from x by letting pi = Xi with probability p, and pi ~ p,p otherwise. 


Functions on the slice The slice (f^^) is analyzed using the measure Vk- The corresponding notion 
of Fourier expansion was described in Section 13.11 A harmonic multilinear polynomial / of degree d can 
be decomposed as / = -/••• + f^‘^, where contains the homogeneous degree k part. The parts 

..., are orthogonal. 

The (i,/)th influence of a function / is Inf®^//] = E[(/ — where f^''^\x) = f{x^’'^'>), and 

is obtained from x by swapping the ith and jth coordinates. We define the ith influence by Inf^[/] = 
n 2j=i Iiifij//], and the total influence by Inf®[/] = YIi=i laf® [/]. The total influence satisfies the Poincare 
inequality 

V[/]«:Infl/]^(deg/)V[/]. 

For a proof, see for example m Lemma 5.6]. 

The noise operator Hp is defined by 

deg/ 

Hpf = Y 

d=0 

The noise stability of / at p is 

deg/ 

d=0 

The noise operator (and so noise stability) can also be defined non-spectrally. We have {Hpf){x) = 
E[/(?/)], where y is obtained from x by taking Po(^2^ log i) random transpositions. 
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Functions on Gaussian space Gaussian space is R" under a measure Qp for an appropriate p. In this 
paper, we mostly consider functions on K” given by multilinear polynomials, and these can be expanded 
in terms of the wg. General functions can be expanded in terms of Hermite functions. Every square- 
integrable function can be written as / = where satisfies + p,... ,axn + p) = 

a'=/='=(xi +p,...,Xn+p)- 

The distributions Pp and Gp have the same first two moments, and this implies that [/] = [/] and 

II/IImp = II/IIGp foi' every multilinear polynomial /. The Ornstein-Uhlenbeck operator Up is defined just like 
Tp is defined for the cube. Noise stability is defined just like in the case of the cube, and we use the same 
notation S° for it. 

The noise operator (and so noise stability) can also be defined non-spectrally. We have (C/p/) (a;) = 
E[/(?/)], where y = {1 — p)p + px + -y/l — N(0,p(l — p)). We can also define noise stability as S);[/] = 

E[/(a:)/(j/)], where {x,y) ~ N((p,p), 


Homogeneous parts For a function /, we have defined / ^ in three different ways, depending on the 
domain. When / is a harmonic multilinear polynomial, all three definitions coincide. Indeed, any harmonic 
multilinear polynomial is a linear combination of functions of the form xa,b- We show that xa,b = Xab' 
under all three definitions. Let A = ai,... ,ak and B = bi,... ,bk. Since xa,b is homogeneous of degree k 
as a polynomial, we see that xa,b = Xa’^b over the slice. Also, 


XA,B 


ipii-p))>^/^fl 

i=l 


Xg, -P 

Vp( 1 -p) 


Xbj-P \ 


Opening the product into a sum of terms, we can identify each term with a basis function ujs for some S of 
size k. This shows that xa,b = Xa% over the cube. Finally, since xa,b is harmonic, in order to show that 
XA,B = Xa'^b Gaussian space, it suffices to show that xa,b{<ax) = a'^XA,B{x), which is true since xa,b is 
homogeneous of degree fc as a polynomial. 


Degrees The following results state several ways in which degree for functions on the slice behaves as 
expected. 

First, we show that degree is subadditive. 

Lemma 3.13. Let f,g be harmonic multilinear polynomials, and let h be the unique harmonic multilinear 
polynomial agreeing with fg on the slice Then degh ^ deg / + degp. 

Proof. We can assume that deg / + degp ^ k, since otherwise the result is trivial. 

Let Ei be the operator mapping a function (f> on the slice to the function on the slice. That is, we take 
the harmonic multilinear representation of cj), extract the z’th homogeneous part, and interpret the result as 
a function on the slice. Also, let E^d = 2^=0 ^ function (p on the slice has degree at most d if and only 

if it is in the range of E^d- 

Qiu and Zhan [29] (see also Tanaka [31]) show that fg is in the range of E<^degf o E^degg, where o is 
the Hadamard product. The operators Ei are the primitive idempotents of the Johnson association scheme 
(see, for example, [U §3.2]). Since the Johnson association scheme is Q-polynomial (cometric), the range of 
E^degf o E^degg equals the range of E^degf+degg, and so deg fg ^ deg/ + degp. □ 

As a corollary, we show that “harmonic projection” doesn’t increase the degree. 

Corollary 3.14. Let f be a multilinear polynomial, and let g be the unique harmonic multilinear polynomial 
agreeing with f on the slice Then degg ^ deg/. 
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Proof. When f = xi, one checks that g is given by the linear polynomial 


1 " 

g = - 'yXxi - x^) + 

n 


i=l 


The corollary now follows from Lemma HS.Inl and from the easy observation deg{aF+PG) ^ max(deg F, deg G). 

□ 


An immediate corollary is that degree is substitution-monotone. 

Corollary 3.15. Let f be a harmonic multilinear polynomial, let g(xi,...,Xn) = 

f(xi,...,Xn-i,b) for b G {0,1}, and let h be the unique harmonic multilinear polynomial agreeing with 
g on the slice Then degh ^ deg/. 

Noise operators We have considered two noise operators, Hp and Tp = Up. Both can be applied syntac¬ 
tically on all multilinear polynomials. The following result shows that both operators behave the same from 
the point of view of Lipschitz functions. 

Lemma 3.16. Let f be a multilinear polynomial of degree at most n/2. For 6 < 1/2 and any G-Lipschitz 
functional and with respect to any exchangeable measure, 

miH,_sf)] - E[V.(t/i-^/)]| = O (^ll/ll) • 

Proof. Let p = 1 — 6. Lemma 13 .1 1 shows that 

n/2 

m{Hpf)]-E[f,{Upf]\^ ^ C^\\Hpf-Upff = ^ - p<^f\\f=X- 

d=0 

Let R{x) = p^. Then p‘^U-i‘i-i)/n) _ pd ^ (^x)) for some x G [d(l — {d — l)/n),d]. For such x, 

R'{x) = p^{—\ogp) ^ ^ p'^/‘^{25), using <5 < 1/2 and d ^ n/2. Therefore 

pdp-id-l)/n) _ pd ^ 25 ^^^ ~ 

n 

The expansion x^/(l—a;)^ = X;”=o ( 2 )®^ implies that d{d—l)p'^/'^ ^ 2p/(l —Since 1 —y/i = 1 —Vl — 5 > 
5 / 2 , we conclude that 

pd(i-(d-i)/n) _ pd ^ 32 ^ 

n 

The lemma follows. □ 


4 On harmonicity 


Let / be a function on the Boolean cube {0,1}", and let / be the unique harmonic function agreeing with / 
on the slice We call / the harmonic projection of / with respect to the slice In this section we 

prove Theorem 14.31 which shows that when / depends on (1 — e) logn variables, it is close to its harmonic 
projection under the measure p.p. Together with Corollary 13. 121 this allows us to deduce properties of / on 
the slice given properties of / on the Boolean cube, an idea formalized in Corollary 14.41 
We start by examining single monomials. 


Lemma 4.1. Let m be a monomial of degree d, and let f be the unique harmonic multilinear polynomial 
agreeing with m on (where d ^ k ^ x),/2). Then deg / = d and the coefficient Cm of m in f is 


n — 2d+ 1 
n — d + 1 


1-0 
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Proof. Without loss of generality we can assume that m = Xn-d+i • • • Xn- Let B = {n — d+ 1,... ,n}. Recall 
that the basis element xb is equal to 

XB = ^ {Xaj - Xn-d+l) ■ ■ ■ {Xa^ - x„). 


Let / be the unique harmonic multilinear polynomial agreeing with m on Corollary 13.141 shows 

that deg / ^ d. The coefhcient f{B) of xb in the Fourier expansion of / is given by the formula f{B) = 
{/j X-b)/IIxsP- Since deg / ^ d, it is not hard to check that in the Fourier expansion of /, the monomial m 
only appears in xb- Therefore the coefficient Cm of to in / is 

llxslr 

since there are (n — d)- summands in the definition of xb■ The value of || xb ||^ is given by Lemma 13.81 and 
Lemma 13.111 

2 fn — d+l\fn — d\ /n — 2d + 2\ ^k-{n — k)- 

= (, 2 2 j-'A 2 P nM • 

We proceed to compute <(/, xb}- Let S e . If /(S')xb(S') ^ 0 then B S, which happens with probability 
k-fn-. The number of non-zero terms (each equal to (—1)'^) is the number of choices of oi,..., / S, namely 

(n - k)^. Therefore </,xb> = (-l)‘^fc^^(n - k)^/n^, and so 


, , ^ k-{n — k)- n— 

™ n- {n — d + \){n — dY • • • (n — 2(i + 2Y{n — 2d+ l)/c^(n — k)d. 

{n — d)-{n — d)- 

{n — d + l)(n — dY • • • (n — 2d -f 2Y{n — 2d+l) 
n — 2d+ 1 
n — d + \ 


Finally, since 7 ^ 0 and deg f ^ d, we can conclude that deg f = d. 


□ 


As a consequence, we obtain a result on Fourier characters on the cube. 


Lemma 4.2. Let co = los be a Fourier character with respect to the measure fip of degree d, and let co be the 
unique haruionic multilinear polynomial agreeing with uj on (f^) (where d ^ np ^ xil2). For tt g {p,p,Qp} 
we have 

||a;-a;||^ = o(^(l_^)^). 

Proof. Recall that 

Lemma |4T] shows that 

where rj involves other monomials. In fact, since w is harmonic, it is invariant under shifting all the variables 
by p, and so 

Ul = CU! + T]', 

where rj' involves other characters. Due to orthogonality of characters we have 

\\dJ-u:t= ll^ll^ - (2c - l)||a;||2 = ||w||^ _ (2c - 1) = ||w||^ - 1 + O (^0 . 
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Since Cj is harmonic, Corollarv l3.12l allows us to estimate ||w||^ given which we proceed to estimate: 

1 


[p[l - p)Y J nd 




d\ k^{n-k)<LA ^^ p)2y(d-«) 


1 


{p{i-p)Y £ 
= 1 + 0 


2 (^1 + 0 


t=0 


p(l -_p)n 


vp(i -p)ny ■ 

Corollary 13.121 shows that the same estimate holds even with respect to tt, and so 

Cp \ „ / d2 


11 + - LoWl = WCjWI -1 + 0 


p{l-p)n 


= O 


p{l-p)n) ■ 


□ 


We can now conclude that a multilinear polynomial depending on a small number of variables is close to 
its harmonic projection. 

Theorem 4.3. Let f be a multilinear polynomial depending on d variables, and let f be the unique harmonie 
multilinear polynomial agreeing with f on (^), where d ^ pn ^ n/2. For tt e {pip,Qp} we have 


\\f-f\\l = o 


i2nd 


d^2 


p{l-p)n 


ml- 


Proof. We can assume without loss of generality that / depends on the first d coordinates. Express / as a 
linear combination of characters: / = X;S(=[d] Clearly / = Xisc[(i] where +s is the unique 

function agreeing with ujs on Lemma 14.21 together with the Cauchy-Schwartz inequality shows that 

u-fX < (^) ^ o (££;) i/ie. 


This completes the proof. □ 

Combining Theorem 14.31 with Corollary 13.121 we show how to deduce properties of / on the slice given 
its properties on the cube. 

Corollary 4.4. Let f be a multilinear polynomial depending on d variables, and let f be the unique harmonic 
multilinear polynomial agreeing with f on where d ^ pn ^ nj^. Suppose that ||/||£ = \\f\\g^ ^ 1- For 

TT e {pp,Qp} we have: 

1. |E,[/]-E.^„[/]| = Op(£^). 

\\f\Y^ = l±Op{^). 

3 - V[/].,„ = V[/]. + Op(£^). 

4. For all P e [0,1], = S=[/]. + Op{^). 

5. For all i d, = \\f=^\\p^ ± Op{^). 
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Proof. Throughout the proof, we are using Corollary 13.121 to convert information on / with respect to tt to 
information on / with respect to Vpn. All calculations below are with respect to tt. 

For the first item, note that 

I E[/] - E[/]| ^ 1/ - /111 < 1/ - /II2 = Op . 

The second item follows from the triangle inequality 

ll/ll -II/-/NII/N ll/ll+ 11/- /II• 

For the third item, notice first that |E[/]| ^ ||/||i ^ ||/||2 = 1. The item now follows from the previous 
two. 

The fourth item follows from the fact that is 1-Lipschitz, which in turn follows from the fact that 
^p[/] = 11^that is a contraction. 

For the fifth item, assume that / depends on the first d variables, and write / = X;Sc[d] ^s^s- We have 

= XI = /=^ + X 

SQ[d] |S|>f 

Lemma 1321 shows that for IS”! > £, ||(d)s)^^p < ||a ;5 — wsp = Op(^). Therefore 

The fifth item now follows from the triangle inequality and the second item. □ 

5 Invariance principle 

In the sequel, we assume that parameters p e (0,1/2] and n such that pn is an integer are given. The 
assumption p ^ 1/2 is without loss of generality. 

We will use big O notation in the following way: / = Op{g) if for all n > N{p), it holds that / ^ C{p)g, 
where N{p),C{p) are continuous in p. In particular, for any choice of pl,Ph satisfying 0 < pl < Ph < 1, 
if p e [pl,Ph\ then / = 0{g). Stated differently, as long as A ^ p ^ 1 — A, we have a uniform estimate 
/ = 0\ (p). Similarly, all constants depending on p (they will be of the form Ap for various letters A) depend 
continuously on p. 


Proof sketch Let 

low slice-influences, 
this implies that 


tjj he a, Lipschitz functional and / a harmonic multilinear polynomial of unit variance, 
and low degree d. A simple argument shows that / also has low cube-influences, and 

E[V^(/)] « E [fj{f)] ± Op . Vd") . 


The idea is now to apply the multidimensional invariance principle jointly to / and to S 
deducing 

E[^(/)l|S|^,] = E[V-(/)l|s|=;.]±e. 

Mp »p 

An application of Lemma 13.91 shows that 


ail H-l~^n —np 

■\/p(l-p)n ’ 


E[W)l|S|=s.] 

Gp 


Pr[|^|s:a]E[V-(/)]. 
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Similarly, 


E[^(/)l|S|^a] = Pr[|5| ^ a](E[V^(/)] ± Oj,{aVd)). 

f-Lp fJjp jJjp 

Since Prep[|<S'| ^ cr] Pr^p[|S'| ^ cr] = 0p(cr), we can conclude that 

E[^(f)]^E[^(f)]±Op(aVd+-). 

vpr, Qp \ cr/ 

By choosing tr appropriately, we balance the two errors and obtain our invariance principle. 

For minor technical reasons, instead of using l|s|s 5 cr we actually use a Lipschitz function supported on 
IS”! s: cr. 

Main theorems Our main theorem is Theorem l5.81 proved in Section lOl on pageEOl This is an invariance 
principle for low-degree, low-influence functions and Lipschitz functionals, comparing the uniform measure 
on the slice Vp^ to the measure /ip on the Boolean cube and to the Gaussian measure Qp. 

Some corollaries appear in Section 15.21 on page [2TJ Corollary 15.91 gives a bound on the Levy distance 
between the distributions f{vpn) and f{Qp) for low-degree, low-influences functions. Corollary 15.111 gives 
a bound on the CDF distance between the distributions fii'pn) and f{Gp) for low-degree, low-influences 
functions. Corollary 15. 121 extends the invariance principle to functions of arbitrary degree to which a small 
amount of noise has been applied. 


5.1 Main argument 

We start by showing that from the point of view of L2 quantities, distributions similar to Pp behave similarly. 

Definition 5.1. Let p e (0,1). A parameter q e (0,1) is p-like if \p — q\ ^ -\/p(l — p)/n. A distribution is 
p-like if it is one of the following: pq, Vqn, Gq, where q is p-like. 

Lemma 5.2. Let f be a harmonic multilinear polynomial of degree d ^ s/n, and let be two p-like 

distributions. Then 

WfWlr = 11/11^2 (l±Op(d/Vn)). 

The same holds if we replace ||/p with Inf®^[/] = ||/ — or Inf ^[/] = ||^||^- 

Furthermore, there is a constant Sp such that if d ^ Sp-\/n then for all p-like distributions 


1 < ll/p, 

2 ^ II/IIP 


^ 2 . 


Proof. Let aoiq) = (2q(l — q))^, where D ^ d. An easy calculation shows that a.'jj{q) = 2(1 — 2q)D{2q{l — 
q))^~^, and in particular \a'jj{q)\ = 0{DaD{q)/q{^ — q)). It follows that for p-like q, aoiq) = q;d(p)(1 ± 
Op{D/y/n)). Lemma l3. 1 II thus shows that for tt e {pq, Vqn, Gq} and all D ^ d, 

WxdWI = aniq) (^I ± Op - P))^ (^1 ± Op . 

Since D ^ d and d ^ ^/ri implies d'^/n ^ df-s/n, we conclude that 


WxdWI = (2p(i -p))^ ^1 ± Op • 

The lemma now follows from Corollary 13.101 


□ 


We single out polynomials whose degree satisfies d ^ Sp^/n. 

Definition 5.3. A polynomial has low degree if its degree is at most Sp^Jn, where Sp is the constant in 
Lemma 15.21 
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We can bound the cube-influence of a harmonic multilinear polynomial in terms of its slice-influence. 


Lemma 5.4. Let f be a harmonic multilinear polynomial of low degree d, and let tt be a p-like distribution. 
For all is [n], with respect to tt; 

Inf“[/]«:Op(^^V[/]+Inf,*[/]). 

Proof. We will show that for the product measure tt = it holds that 

2(i 2n 


Inf^[/] < 


p(l -p){n-d) 


V[/] + 


p{l-p){n-d) 


Inf|[/] 


which will imply the statement of the lemma by Lemma 15.21 

The idea is to come up with an explicit expression for Inf®[/]. Let j ^ i. For S not containing j we 
have 

4'^^ = UJS, 4uW = 

Therefore 

Inf4/] = 1/ - = 2] if{S u {*}) - f{S u 

^,jts 

On the other hand, we have 

p(l - p) Inf,^[/] = 2 fiSf ^ 2 Ksfin - |5|) = E E ^ 


S3l 


S3l 


j^i i,jfS 


The L2 triangle inequality shows that f{S u {i})^ ^ 2/(5' u {j})^ -b 2(/(S' u {i}) — f{S u and so 


p{l-p) Inf^[/] ^ 




n — d 


< 


j¥=i i,j$S 
2p{l-p) 


j¥=i 




2n 


l<[f] 


< 




using the Poincare inequality. Rearranging, we obtain the statement of the lemma. □ 

Using Lemma Em we can show that the behavior of a low degree function isn’t too sensitive to the value 
of q in Vqn. 

Lemma 5.5. Let f be a harmonic multilinear polynomial of low degree d, and let I be an integer such that 
vi is p-like. For every C-Lipschitz functional ij}, 


I E E [V^(/(x))]| = Op 

X'^yg, 

Proof. Let q = £/n, which is p-like. For i e [n], let (XbF®) be the distribution obtained by choosing 
a random e and setting F* = X® u {i}. Note that /(X*) — /(F*) = (/ — f^^^){X^). Since 

Prjy^ [aji = 0] = 1 — q, we have 

E[(/(X*) - /(F0)2] (1 - q)-^ Inf^[/]., = Op V[/].^„ + Inf|[/].,„) , 

using Lemma 15.41 and Lemma 15.21 
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Consider now the distribution (X, Y) supported on x (j”\) obtained by taking X ~ Vf, and choosing 
Y ^ X uniformly among the n — i choices; note that Y ~ Since (X,Y) is a uniform mixture of the 

distributions we deduce 

E[(/(X) - /(r))2] ^ Op 

=SOp(^V[/].^„), 

using the Poincare inequality Inf®[/] ^ (iV[/] (see Section 15^ . The lemma now follows along the lines of 
Lemma 13.11 □ 

We now apply a variant of the invariance principle for Lipschitz functionals due to Isaksson and Mossel. 

Proposition 5.6 ( [181 Theorem 3.4]). Let Qi,..., Qk be n-variate multilinear polynomials of degree at most 
d sueh that with respect to fip, V[Pi] ^ 1 and Inf^[Pi] ^ r for all i e [fc] and j e \n\. For any C-Lipschitz 
functional 'L: —> K (i.e., a function satisfying |4'(a;) — 'I'(y)| ^ C'lja; — y\\ 2 ), 

I E[vl/(gi,..., Qfe)] - E['k(Qi,..., Qu)]\ = Ou{Cpyi^), 

Up Gp 

for some (explicit) constant Pp ^ 1. 

Lemma 5.7. Denote 

\~\n 

■\/p{l-p)n 

Let f be a harmonic multilinear polynomial of low degree d > 1 such that with respect to pp, E[/] = 0, 
V[/] ^ 1 and lnf)'[/] ^ r for all i e [n]. Suppose that r ^ R~'^ and n ^ R^, for some constant Rp. For 
any C-Lipschitz functional ip such that ipiO) = 0 and B-Lipschitz functional (p (where B ^ 1) satisfying 

||</^||oo ^ I7 

I Emms)] - Em)cp{s)]\ = m (vBpf (r + ^y ^. 

The condition Inf®[/] ^ r for all i e [n] can be replaced by the condition Inf“[/]^p ^ t for all i e [n]. 

Proof. For M to be chosen later, define 


tpix) 


—M if 'tp{x) ^ —M, 

< 1 p{x) if — M ^ ^ 

M if M ^ 


It is not hard to check that ip is also C-Lipschitz. 

We are going to apply Proposition 15.61 with Qi = f, Q 2 = S/^/p{l — p)n, and '^{yi,y 2 ) = ip{yi)(p{y 2 )- 
With respect to pp, ¥[( 52 ] = 1 and Inf(^[Q 2 ] = l/(p(l — p)n) for all i e [n]. Lemma [531 shows that 
Inf^/] = Op{^ + t), and so the cube-influences of Qi,Q 2 are bounded by Op(T + ^). Since 


I^'(yi,y2) - 4'(2 :i,z 2)| < |4'(yi,y2) - '^{yi,Z2)\ + \'S{yi,Z2) - '^>(zi,Z2)\ ^ 

MB\y2 - Z2I + C'lyi - zi|, 


we see that ¥ is {XIB + C)-Lipschitz. Therefore 


\E[ip{ms)] 

Mp 


E[(p{f)m]\ = Op 

Gp 


(^{MB + C)p^p 
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Next, we want to replace '0 with ip. For tt e {iip,Qp} we have 


I E[0(/)0(,S)] - E[iP{mS)]\ ^ E[|0(/)| |</)(5)|1|^(/)|>m] < 

TT TT TT I -r VJ / I 

Therefore 

I E[iPimS)] - E[0(/)0(5)]| = Op (^{MB + C)p'^p [r+^y + • 

Choosing M = C/-\JBp'^ (t + completes the proof. The conditions on r, n guarantee that 
1, and so B > 1 allows us to obtain the stated error bound. 

In order to finish the proof, we combine Lemma 15.71 with Lemma 15.51 

Theorem 5.8. Let f be a harmonic multilinear polynomial of degree d such that with respect to Vpn, V[/] ^ 1 
and Inf^[/] ^ r for all i e [n]. Suppose that r ^ Ip‘^S^ cind n ^ IpfS^, for some constants Ip, K. For any 
C-Lipschitz functional ip and for tt g {Qp, pp}, 

I E[0(/)]-E[0(/)]| = Op(O<5). 

l^pn 

The condition Inf^[/] ^ r for all i G [n] can be replaced by the condition Inf^[/]^p ^ r for all i G \n\. 

Proof. We prove the theorem for n = Qp. The version for pp then follows from the classical invariance 
principle, using Lemma 15.41 

Replacing / with / — E[/] (recall that the expectation of / is the same with respect to both pp and tt ) 
doesn’t change the variance and influences of /, so we can assume without loss of generality that E[/] = 0. 
Similarly, we can replace ip with ip — 0(0) without affecting the quantity Ej,^„ [0(/)] — E^^ [0(/)], and so we 
can assume without loss of generality that 0(0) = 0. 

For a parameter ct ^ 1 to be chosen later, define a function p supported on [—a, a] by 


p{x) 


1 + x/a if — cr ^ a: ^ 0, 
1 — x/a if 0 ^ X ^ cr. 


Note that ||(/)||oo 


1 and that p is (1/(T)-Lipschitz. Lemma [5.71 (together with Lemma [5.21) shows that 


\E[p{f)p{S)] 


E[0(/)0(5)]| = COp ( 


1 / 12 ^ 




assuming r ^ Rp and n > Rp (the condition on n implies that d is low degree). 

Let a be the distribution of xi + • • • + x„ under Qp. Lemma 13.21 and Lemma 13.91 show that 


mp{f)p{s)] 


E [ E mm )] 

7p,g ^yp{l-p)n -* 


E[0(/)] E [Pi 

Gp 


q-np \-| 
^p{l-p)n 


E[0(/)]E[0(5)]. 
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Similarly, Lemma 15.51 shows that 


Therefore 






f-ip ^np l^p 

y Prr6'= U( )\E\ib(f)]- 

\k—np\^a'y/p{l—p)n 

y Pr[g= ](/>( )\k-np\Op 

I fc—np I ^ (7-yp^l—p)n 

; E[(/.(S')]Op(C(TVd). 


E[y/)]| 

^pn 



\Em)]m{s)]- E[y/)E[</.(y]|ss 

t/p ^p i^Tip Mp 

+ n-1/24)) + OpiCaVd E [</)(y]). 

Mp 

Proposition 15.61 shows that 

I E[</>(y] - E[</>(y]| = Op(a-i/2„-i/6). 

»p Mp 

Moreover, Eg^[ip{f)] ^ C'Egp[|/|] = Op{C). It follows that 

E[0(y]|E[V-(/)]- E m)]\ ^ Op{Ca-^^^pyy^^^ + n-^^y) + Op{CaVdE[^iS)]). 

Mp »p i^np l^p 

It is not hard to check that Egp[0(5')] = 0p(cr), and so for n > Ap(j~'^ we have E^p[(/)(5')] = 0p(cr), implying 

I E[V^(/)] - E [V^(/)]| ^ OpiCa-^l^pfyl^’^ + n-^iy + CaVd). 

4/p i^np 

Choosing a = I d}A ^ obtain 

I E[V-(/)] - E m)]\ ^ + „-i/60)rf3/i0)^ 

4/p ^np 

It is not hard to check that if d ^ BpU^ and n > Mp then n > Apa~^, and that if T,n~^ ^ then 

(7^1; these are the conditions necessary for our estimate to hold. In fact, for an appropriate choice of 
7 p > 7 , the condition n ^ implies the condition d ^ Bpn^, and furthermore allows us to estimate 
j^-i/60j^3/io _ Op{n~^Ao^ (say), and to control the other error term similarly. This completes the proof of 
the theorem. □ 

5.2 Corollaries 

Theorem IS. 81 allows us to bound the Levy distance between the distribution of a low degree polynomial with 
respect to Vpn and the distribution of the same polynomial with respect to Qp or pp. This is the analog 
of [ini Theorem 3.19(28)]. 

Corollary 5.9. Let f be a harmonic multilinear polynomial of degree d such that with respect to Vpn, V[/] ^ 1 
and Inf®[/] ^ r for all i e \n\. There are parameters Xp,X such that for any 0 < e < 1/2, if t ^ X~‘^e^ 
and n > X^e^ then the Levy distance between fiiy'pn) and /(tt) is at most e, for tt e {Qp,pp}. In other 
words, for all a, 

Pr [/ ^ cr — e] — e ^ Pr[/ < ct] ^ Pr [/ ^ cr + e] + e. 

I'pn TT l^pn 
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Proof. Given a and e, define a function ip by 


fj{x) 


0 if cc ^ tJ, 

< if cr ^ a; ^ cr + e, 

1 if cc ^ cr + e. 


Note that tp is (l/e)-Lipschitz. Theorem 15.81 shows that if t ^ Ip '^5^ and n ^ 

Pr[/ ^a]~PT[f ^a + e]^ E[^(/)] - E [^(/)] = Op{6/e). 

TT 2/pn TT 2/pn 

We can similarly get a bound in the other direction. To complete the proof, choose S = Cpt^ for an appropriate 
cp. im 

Using the Carbery-Wright theorem, we can bound the actual CDF distance. This is the analog of 
Theorem 3.19(30)]. 

Proposition 5.10 (Carbery-Wright). Let f he a polynomial of degree at most d sueh thatY^f^g.^ = 1. Then 
for all e > 0 and all x, 

Pr[|/-a;| ^ e] = 0{de^l'^). 

Qp 

Corollary 5.11. Let f be a harmonic multilinear polynomial of degree d such that with respect to Upn, 
V[/] = 1 and Inf®[/] ^ r for all is [n]. There are parameters Yp,Y such that for any 0 < e < 1/2, 
if T ^ {Ypd)~‘^e^‘^ and n > {Ypd)‘^/e^‘^ then the CDF distance between /(t'pn) and /(tt) is at most e, for 
TT e {Gp,tip}- In other words, for all a. 


|Pr[/^a]-Pr[/^a]| ^e. 

l>pn tt 

Proof. It is enough to prove the corollary for n = Gp, the other case following from the corresponding result 
in the classical setting. Corollary 15.91 and the Carbery-Wright theorem show that for r ^ Xp’^p^ and 
n > Xp/rj^ we have 

Pr[/ ^ ct] ss Pr[/ ^ cr + ? 7 ] + 7? ^ Pr[/ ^ cr] + Op{dr]^^‘^). 

We can similarly obtain a bound from the other direction. To complete the proof, choose ry = Cp{e/d)‘^ for 
an appropriate Cp. □ 

All bounds we have considered so far apply only to low degree functions. We can get around this 
restriction by applying a small amount of noise to the functions before applying the invariance principle 
itself. This is the analog of [26l Theorem 3.20]. 

Even though the natural noise operator to apply on the slice is Hp, from the point of view of applications 
it is more natural to use Up (which we apply syntactically). Lemma 13.161 shows that the difference between 
the two noise operators is small. 

Corollary 5.12. Let f be a harmonic multilinear polynomial such that with respect to Vpn, V[/] ^ 1 and 
Inf®[/] ^ T for all i e [n]. There is a parameter Zp such that for any 0 < e < 1/2 and 0 < (5 < 1/2, if 
T ^ and n > l/e^p/'^ then for tt e {Gp, Pp), 

1 E [^(t/i_5/)] - E[^([/i_,/)]l = Op{Ce). 

^pn tt 

Proof. Let g = Uisf. Let d be a low degree to be decided, and split g = g^‘^ + g^‘^. With respect to Vpn, 
= 2]t>£;(l — ^ (1 ~ On the other hand. Theorem 15.81 shows that if r ^ and 

n > Ip/^^ then 

1 E[V^(5=S'^)]-E[^(5=s-^)]l = Op(C'e). 

Vpn. tt 
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Since II5 — g^'^W = ||g^'^||, Lemma [XT] and Lemma [Q] show that as long as the degree d is low, 

I E [^{g)] - E[^{g)]\ = OpiCe + C(1 - J)'') = Op{Ce + Ce-^^). 

l^pn TT 

Choosing d = log(l/e)/5, the resulting error is Op(C'e). This degree is low if \og{l/e)/5 ^ Spy/n, a condition 
which is implied by the stated condition on n. □ 


6 Majority is stablest 

Recall Borell’s theorem. 

Theorem 6.1 (Borell [5]). Let f: K" ^ [Oj 1] have expectation p with respect to N(0,1)". Then §p[/] ^ 
Tp{p), where Tp{p,) is the probability that two p-correlated Gaussians be at most 

Borell’s theorem remains true if we replace the standard Gaussian with Qp. Indeed, given a function /, 
define a new function g by g{x) = /(•\/p(l — p)x + p). If x ~ N(0,1) then y'p(I — p)x + p ~ Qp, and so 
lE[/]Sp = E[p]n(o,i)- Similarly, Sp[/]ep = §p[5]n(o,i)- Indeed, \i y = px + 1) then 

+ p = p + pVp{^-p)x + Vr^^N(o,p(i -p)) 

= (1 - p)p + p{\/p{l - p)x + p) + s/l- p2 N(0,p(I -p)). 

Therefore Borell’s theorem for / and Qp follows from the theorem for g and N(0,1). 

Majority is stablest states that a similar bound essentially holds for all low influence functions on the 
slice. This result was originally proved using the invariance principle in |26| . An alternative inductive proof 
appears in [7]. 

It is known (see for example [26]) that the bound (p) is achieved by threshold functions. Corollarv l3.I2l 
together with Lemma [3. 161 shows that threshold functions achieve the bound also on the slice. Indeed, take 
a threshold function f on d variables such that with respect to Pp, E[/] = p and §))[/] 5= rp(p) — e. Let / be 
the restriction of / to the slice (^^). Corollary 13.121 shows that E[/] = p ± o„(l) and Sp[/] = §p[/] ± o„(l). 
Lemma 13. 161 shows that Sp[/] = Sp[/] ± o„(l). Therefore for large n, E[/] kk p and §p[/] > rp(p) — 2e. 

Our proof of majority is stablest closely follows the proof of [5S1 Theorem 4.4] presented in [^ §11-7]. 
We need an auxiliary result on Tp. 

Proposition 6.2 f [26l Lemma B.6]). For each p, the function Tp defined in Theorem \6.1\ is 2-Lipschitz. 

Theorem 6.3. Let f: (^')|) ^ [0,1] have expectation p and satisfy Inf^[/] ^ r for all i e [n]. For any 
0 < p < 1, we have 

%[f] ^ ^pid) + Op,p ^ ^ + Op , where a = min(T, L). 

The condition Inf^[/] ^ r for all i e [n] can be replaced by the condition lni'j[f]p^ ^ r for all z e [n]. 

Proof. We identify / with the unique harmonic multilinear polynomial agreeing with it on . For a 
parameter 0 < (5 < 1/2 to be chosen later, let g = Hisf- Note that the range of g on is included in 
[0,1] as well, since LIis is an averaging operator. We have 

pn 

S/[/] - S/M =S/[/]-§/(!_,).[/] = ^/(i-G-i)/-)(i_(i_5)2Ai-G-i)/n))||;=d||2, 

d=0 

Since d ^ n/2, we have 

^d{l-(d-l)/n)^^ g^2d{l-(d-l)/n)^ ^ pd/2^^ _ _ g^2d^ ^ 2Sdp^/^. 
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(1) 


The expansion x/{l — xY = Yjd shows that ^ \fpl^ — -sfpYi and so 

Vp 


l§p[/]-S^[5]|<2d 




From now on we concentrate on estimating §p[g]. 
Define the clumped square function Sq by 


Sq(a;) = < 


0 if a; ^ 0 , 

x"^ if 0 ^ re ^ 1 , 
1 if re ^ 1 . 


It is not difficult to check that Sq is 2-Lipschitz. Corollary 15.121 together with Lemma 13.161 shows that for 
all e > 0, if T, i ^ £^p/' 5 tiien 


I SpM - SpMI = I E [Sq(Lf^5)] - E[Sq(C/^5)]| = Op(e) + O 

^pn t/p 


(1 - V~p)- 


( 2 ) 


We would like to apply Borell’s theorem in order to bound Sp[ 5 ], but g is not necessarily bounded by 
[0,1] on R". In order to handle this, we define the function g = max(0, min(l, 5 )), which is bounded by [0,1]. 
Let dist|-Q ]^] be the function which measures the distance of a point x to the interval [0,1]. The function 
dist[op] is clearly 1-Lipschitz, and so Corollary 15.121 implies that under the stated assumptions on t, i, we 
have 

E[|5-5|] = E[dist[o,i](ff)] = I E [dist[o,i](5)] - E[dist[o.i](g)]| = Op(e). 

Vp »p ^pn »p 

Since is an averaging operator and Sq is 2-Lipschitz, we conclude that 

I §pM - %m\ = I E[Sq(C/vp5)] - E[Sq(C/vp5)]| = Op{e). (3) 

s/p s/p 


Lemma [31^ shows that Egp[ 5 ] = = p, and so | Egp[g] — p| = Op{e). Proposition l6.2l imDlies 

that rp(E[g]) ^ rp(/r) -I- Op(e). Applying Borell’s theorem ([Theorem 16.1|1 . we deduce that 

§=[5]s:rp(E[5])^rp(p) + Op(e). (4) 

Putting (II|),(l2]),([3|),(j4]) together, we conclude that 


Sp[/] rp(p) -f Op(e) -f o 


(1 - Vp)-^ 


+ 26 


Vp 

(1-Vp)"' 


Taking 6 = e, we obtain 

§;[/] < rp(p) + Op.p(e) + Op . 

The bounds on r, — now become r, — ^ from which we can extract the theorem. □ 

’ n ’ n ’ 


7 Bourgain’s theorem 


Bourgain’s theorem in Gaussian space gives a lower bound on the tails of Boolean functions (in this section. 
Boolean means that the range of the function is { + 1}). We quote its version from Theorem 2.11]. 

Theorem 7.1 (Bourgain). Let /: K" ^ {±1}- Aor any k ^ 1 we have, with respect to Gaussian measure 
N(0,1), 


||/>'=P = D 
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While the theorem is stated for N(0,1), it holds for Qp as well. Indeed, given a function /, define a new 
function g by g{x) = - p)x+p). If x ~ N(0,1) then ^/p(l -p)x+p ~ Gp, and so V[/]ep = V[g]N(o.i)- 

Our definition of for Qp makes it clear that = /^*(-\/p(l — p)x + p), where 5 ’"* is the degree i 

homogeneous part of g. This implies that ||/^^||g^ = ||5 ^^|In(o i)- Therefore Bourgain’s theorem for / and 
Gp follows from the theorem for g and N( 0 , 1 ). 

Following closely the proof of EH Theorem 3.1], we can prove a similar result for the slice. 

Theorem 7.2. Fix fc > 2. Let f: (^’h) ^ {+1} satisfy Inf®[/*®^] ^ r for all i e \n\. For some constants 
Wp,k,C, ifT ^ VF-i V[/]^ and n > Wp,k/^[fT then 





The condition Inf^[/] ^ r for all i e [n] can be replaced by the condition Inf^[/]^p ^ r for all is \n\. 
Proof. We treat / as a harmonic multilinear polynomial. Since / is Boolean, working over i/p„ we have 

WrT = 11 / - > 11/^'= - Sgn(/=S'=)P = ||/=S'=||2 + 1 _ 2E[|/=S'=|]. 


Lemma lOl shows that Wf^’^Wg^ = ||/'®^P(1 + Op{k"^/n)). Since the absolute value function is 1-Lipschitz, 
Theorem lS.SI applied to the parameter 5 > 0 shows that if r ^ Ip’^S^ and n ^ Ip/S^ then then | [|/**^|] — 

= Op{6). This shows that 


f > fe ||2 


>llf 


sgfe 


sgn(/=S'=: 




Let g = sgn(/**^). With respect to Gaussian measure Gp, ||/*** — using 

Bourgain’s theorem (Theorem [73J ■ Putting everything together, we conclude that 


11 / 


>k\\2 




/V[g]c 


V Vk 


— Op I 5 + 


(5) 


It remains to lower bound V[g]c;p. Note first that over Vpn, V[/] = 4Pr[/ = 1] Pr[/ = —1], and so 
Pr[/ = l],Pr[/ = —1] > V[/]/4. We can furthermore assume that 

Pr[/=s'= |], Pr[/=s'= < -|] > V [/]/ 8 , 


since if for example Pr[/**^ 5= |] ^ V [/]/8 then with probability at least V [/]/8 we have / = 1 and f^’^ < |, 
and so = 11/ - f^'^f > I • V [/]/8 = fI(V[/]). Corollary ED applied with e = V[/]/16 ^ 1/3 shows 

that for an appropriate c, if r ^ tK~^fc and n > cXp then 

Pr[/=s'=>i],Pr[/^'=<-i]5= V[/]/16, 

yp yp ^pn 

and so Vgp[g] > 4 [/]/16(l - [/]/16) = n(Vp,^„[/]). Combining this with ([5]) shows that under Vpn, 


11/ 


>/ c ||2 


> Gl 




Op{5+^'). 


( 6 ) 


Choosing 5 = CpV[/]/'\/fc for an appropriate Cp completes the proof. □ 

We do not attempt to match here EH Theorem 3.2], which has the best constant in front of V[/]/Vfc. 
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8 Kindler—Safra theorem 


Theorem 17.21 implies a version of the Kindler-Safra theorem [22 [20] , Theorem 18.51 below. 

We start by proving a structure theorem for almost degree k functions. We start with a hypercontractive 
estimate due to Lee and Yau [13] (see for example [131 Proposition 6.2]). 

Proposition 8.1. For every p there exists a constant rp such that for all functions f: (^"^) —> R, ||^frp /||2 ^ 
II/II 4 / 3 - 

This implies the following dichotomy result. 

Lemma 8.2. Fix parameters p and k, and let f: —> {±1} satisfy ||/^^P = e. For any i,j e [n], either 

InfL[/] ^ e/2 or Inf/^ [/] Jp,k, for some constant Jp^k- 

Proof. Let r = rp be the parameter in Proposition 18.II Let g = {f — /^*-’^)/2, so that Inf^j[/] = Ijl^p. Since 
g{x) e {0, ±1}, ||g|| 4 y 3 = || 5 p = 4InfL[/]. Proposition 18.II therefore implies that 

(4Inf^[/])3/2 = || 5 || 2^3 ^ iHrgWl > WHrg^^^Wl > 

Since g = {f — f^^^'^)/2, we can bound ^ ll/^*P = £• Therefore 

(4Inf^[/])3/2 ^ r-'=(||5p - e) = r'=(4Inf^[/] - e). 

If 4Inf®j [/] > 2e then 4Inf/^ [/] — e > 4Inf®^ [/]/2 and so 4InfL[/] > r^’^/A. □ 

We need the following result, due to Wimmer [331 Proposition 5.3]. 

Lemma 8.3 ([331 Proposition 5.3], [131 Lemma 5.2]). Let /: (^/j) ^ R. For every r > 0 there is a set 
J c [n] of size 0(Inf®[/]/r) such that lniL[f] < r whenever i,j f J. 

Combining Lemma [8.21 and Lemma [8.31 we deduce that bounded degree functions depend on a constant 
number of coordinates, the analog of Theorem 1]. 

Corollary 8.4. Fix parameters p and k. If f' > { + 1} has degree k then f depends on Op^fe(l) 

coordinates (that is, f is invariant under permutations of all other coordinates). 

Proof. Apply Lemma 18.31 with r = Jp ^ to obtain a set J of size 0(k/Jp^k)- Lemma 18.21 with e = 0 shows 
that for i,j f J we have InfC[/] = 0, and so / is invariant under permutations of coordinates outside of 

J. □ 

Using Bourgain’s tail bound, we can deduce a stability version of Corollarv l8.4l namely a Kindler-Safra 
theorem for the slice. 

Theorem 8.5. Fix the parameter k ^ 2. Let f: (^^) ^ {+1} satisfy ||/^^P = £■ There exists a function 
h: (f/j) —> {±1} of degree k depending on Ofc_p(l) coordinates (that is, invariant under permutations of all 
other coordinates) such that 

\\f-hf = Op,k + 

for some constant C. 

Proof. Let F = f^^. We can assume that 2e < Jp^k(2, since otherwise the theorem is trivial. Apply 
Lemma lA3l to F with parameter r = Jp^k — 2e > Jp,kf2, obtaining a set J of size 0{k/T) = Op^fc(l). It is 
not hard to check that 

Inf^,.[A] < Inf^,.[/] < Inf^[F] + 21|/>'=1|2 = Inf^[F] + 2e. 
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Therefore if i,j^J then Inf®j[/] < r + 2e = Jk^p, and so Lemma 15?^ shows that Inf^j [F] ^ Inf®j [/] = 0{e). 

For cc e {0,1}'^, let Gx and gx result from F and / (respectively) by restricting the coordinates in J to the 
value X. It is not hard to check that Prs~j/p„[<51 j = x] ^ {p — Op(| J|/n))l'^l = np^fe(l), as long as n > Np^k 
for some constant Np^k] if n ^ Np^k then the theorem is trivial. We conclude that Inf5[G'a;] = Op,fc(e) for 
all and \Gx — 5a;P = = Op.fc(e)- Together these imply that Inf[p(/a;] = Op^k{f) for all i,j ^ J, 

and so Infp^^:] = Op^k{^) for all J. 

We can assume that n — | J| > n/2 (otherwise the theorem is trivial) and that the skew px of the 
slice on which Gx.Px are defined satisfies Px = p + Op(|J|/n) = 0(p), and so Theorem 17.21 implies that 
either maxilnf®[gj > W~lY[gx]'^, or n < 2Wp^k/Y[gx]^, or Y[gx] = 0{Vk\\g>'^\\‘^) = Op,k(e)- Since 
maxi Inf®[gia;] = Op_fe(e), we conclude that 

Vfe]. O,,, p/o + . 

Define a function g by g{S) = ]E[5s|^]. The bound on Y[gx] implies 

\\f-gf = Op,k + 

If we let h = sgng then we obtain the desired bound ||/ — hp ^ 4||/ — (;p. 

It remains to show that h has degree fc if e is small enough and n is large enough. We can assume 
without loss of generality that J = [M], where M is the bound on | J|. We have ||/ — hf^ ^ ||/^^ ~ ^ 

(||h^^|| — -pe)^. Therefore 

||/i>p| ^ Ve + Op.fc . 

On the other hand, we can write h as & Boolean function H of a;i,..., xm- Lemma l4.II shows that deg h ^ 
degiL, and so degh > k implies that degiL > k. Corollary 14.41 51 implies that for large enough n, = 

//(!)■ Since there are only finitely many Boolean functions on xi,... ,xm which can play the role of FI, 
we conclude that if e is small enough and n is large enough then deg h ^ fc. □ 

We conjecture that Theorem IS. 51 holds with an error bound of Op_fe(e) rather than Op,fe(e^/‘" + l/n^/'^). 


9 ^-Intersecting families 

As an application of Theorem 18.51 we prove a stability result for the t-intersecting Erdos-Ko-Rado theorem, 
along the lines of Friedgut m- We start by stating the t-intersecting Erdos-Ko-Rado theorem, which was 
first proved by Wilson [5^ . 

Theorem 9.1 ([32]). Let t ^ 1, k ^ t, and n ^ {t + I)(fc — t + 1). Suppose that the family F c ([)(p is 
t-intersecting: every two sets in T have at least t points in common. Then: 

(“) 1^1 < G-.)- 

(b) If n> [t + l)(fc — t + 1) and |J^| = (^Zj) then T is a t-star: a family of the form 

F={Aeyy^ -.S^A}, |5| = t. 

(c) Ift^2,n={t+ l)(fc — t + 1) and |J^| = {^Zl) then F is either a t-star or a {t, 1)-Frankl family: 


^ : |An 5| > t + 1}, |5| = t + 2. 
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The case t = 1 is the original Erdos-Ko-Rado theorem m- Ahlswede and Khachatrian mm found the 
optimal t-intersecting families for all values of n, k, t. 

A stability version of Theorem 19.11 would state that if |J^| (^Zi) then T is close to a t-star. Frankl [H] 

proved an optimal such result for the case t = 1. Friedgut m proved a stability result for all t assuming 
that k/n is bounded away from l/(t + 1). 

Theorem 9.2 ([12]). Let t ^ 1, k ^ t, X,( > 0, and Xn < k < — ()n. Suppose T c ([”]) is a 

t-intersecting family of measure |J^| = (^Zt) ~ there exists a family Q which is a t-star such that 


\T/xg\ 


- Ot,A,c(e)- 


Careful inspection of Friedgut’s proof shows that it is meaningful even for sub-constant Q, but only as 
long as C = ^^(1/vZi). We prove a stability version of Theorem 19.11 which works all the way up to C = 0. 

Theorem 9.3. Let t^2, k^t+1 and n = {t + l){k — t + 1) + r, where r > 0. Suppose that k/n ^ A for 
some A > 0. Suppose T c (M) is a t-intersecting family of measure |2^| = {/.-t) ~ there exists 

a family Q which is a t-star or a (t, 1)-Frankl family such that 


l-^Ag| 

il) 


= 0; 


t,x 


^max 



for some constant C. 

Furthermore, there is a constant At^\ such that e ^ At^\ min(r/fc, implies that Q is a t-star. 

We do not know whether the error bound we obtain is optimal. We conjecture that Theorem 19.31 holds 
with an error bound of Ot.\(jnax{k/r, l)e). 

Friedgut’s approach proceeds through the Hp version of Theorem 19.11 first proved by Dinur and Safra [8] 
as a simple consequence of the work of Ahlswede and Khachatrian. The special case p = 1/d (where d > 3) 
also follows from earlier work of Ahlswede and Khachtrian [2], who found the optimal t-agreeing families in 

Theorem 9.4 l[8].|17j.[T^b Let t^l and p ^ l/{t -\-l). Suppose that T c {0,1}” is t-intersecting. Then: 
(a) iXp{T) ^ ig/. 

(h) If p < l/{t -\- 1) and g.p{T) = p* then IF is a t-star JiTf . 

(c) If t ^ 2, p = l/(t + 1) and fJ-p{F) = p* then F is either a t-star or a {t, 1)-Frankl family 

Friedgut HZ] deduces his stability version of Theorem 19.11 from a stability version of Theorem 19.41 While 
Friedgut’s stability version of Theorem 19.41 is meaningful for all p < l/{t 4- 1), his stability version of The¬ 
orem [9T] is meaningful only for k/n < l/(t 1) — uj{l/-\/ri). A more recent stability result for compressed 
cross-t-intersecting families due to Frankl, Lee, Siggers and Tokushige [15], using completely different tech¬ 
niques, also requires k/n to be bounded away from l/{t+ 1). 

Friedgut’s argument combines a spectral approach essentially due to Lovasz [24] with the Kindler-Safra 
theorem [ZUHO]- Using Theorem 18.51 instead of the Kindler-Safra theorem, we are able to obtain a stability 
result for the entire range of parameters of Theorem 19.II We restrict ourselves to the case t > 2. 

Our starting point is a calculation due to Wilson [32] . 

Theorem 9.5 ([32]). Let t ^ 2, k ^ t-\-l, and {t+ l){k — t+ 1). There exists an (f^^) x ([((I) symmetric 
matrix A such that Ass = 1 for all S e Ast = 0 for all S T e (f”^) satisfying \S r\T\^ t, and for 
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all functions f: —> K., 


^/ = 2 


e=0 


Ae=l + (-l)*-l-^2(-ir 


i=0 


k — 1 — i\ fk — e\ fn — k — e + i\ fn 


k — t 


The eigenvalues Ae satisfy the following properties: 

(-ij Ai . ■ ■ ■ . A, . 0. 

(c) Xt +2 > 0, with equality if and only if n = (t + l){k — t + 1). 

(d) At+i > At +2 and Ae > At +2 for e > t + 2. 


k — e 


k — t + i\ 

k — t J 


Wilson’s result actually needs n ^ 2k, but this is implied by our stronger assumption fc > t + 1 (Wilson 
only assumes that k ^ t) since (t + l)(fc — t + 1) — 2fc = (t — l)(fc — (t + 1)) 5= 0. 

We need to know exact asymptotics of At+ 2 . 


Lemma 9.6. Let t ^ 2, k ^ t+1 and n = (t + 1 + p){k — t + 1), where p> 0. Let A = At +2 be the quantity 
defined in Theorem 1 9 . 51 Then 

A = nt(min(p, 1)), lim A = 1. 

p—>(X) 

Proof. Wilson [32l (4.5)] gives the following alternative formula for A: 


A= 1 

Algebraic manipulation shows that 


t + 1 


E 

2 = 0 


z + 2 


i 


V 2+2 ) 


A - 1 - ^(i + 1) 


i=0 




i + 2j (n-k-t+i\ ■ 
\ 2 + 2 / 


Calculation shows that n — k — t = {t p){k — t 1) — 2t 1. Therefore 


t-i 


A - 1 - ^(z + 1) 


2 = 0 


t + 1 
z + 2 


/fc-n 

U+2/ 


^(i+p) (fc—1) — +1+A^ 


This formula makes it clear that limp_>oo A = 1, and that A is an increasing function of p. 
Assume now that p ^ 1. Then 


A = l-^(* + l) 


2 = 0 


t + 1 
z + 2 


{t + pY 


i±0,|i 
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Let us focus on the main term. Setting a = l/(t + p), we have 




i=0 


t+ 1 
i + 2 


i=0 




- (^+ 1) Zj 


2 = 0 


z + 2 


z + 1 


2 = 0 
t-1 


t+ 1 
z + 2 


2-|-2 


a 


2-|-2 


-L 


2 = 0 


i + 1 
z + 2 


a 


, 2-|-2 




t+i 


t+ 1 


z = l i=2 

\t ffl , „At+l 


= {t+ l)a((l + ay - 1) - ((1 + ay+^ - 1 - {t + l)a) 
= 1 — (1 + «)*(! — to). 


Substituting a = l/(t + p), we obtain 

t-i 


E(* + i) 


2 = 0 


t + 1 
z + 2 


/, ^-^-2 _ 1 _ ft + p + 1)* P ^ , _ pft + 1 + pY 
^ {t + pY t + p ft + p)‘+i ■ 


Therefore when p ^ 1, 


p(t + 1 + p) 

ft + p)‘+i - 


t 


In particular, we can find some constant Ct such that 



it + i + pY 

{t + p)*+i 



Therefore for 2Ctlk ^ p ^ 1, we have A = flt{p)- Since A is an increasing function of p, this shows that for 
p > 2Ctlk, we have A = r2t(min(p, 1)). 

In order to finish the proof, we handle the case p ^ Ct/k. Consider n = {t + I)(fc — t + 1) + I. The value 
of I — A in this case is 



/fc-n 

Vi+2/ 


^i(fe-t+l)-2t+2+i^ 


ft + lA ("+2) 

+ 2y \ i J ^t(fe-t+i^-2t+i+»^ 


z + 2 


t(^k — t + 1) — 2t + 2 + 2 


The value of the last expression without the correction term I — t(fc_^_)_^^^ 2 t+ 2 +z exactly 1 by Theorem l9.5l 
and so 


Since A is increasing in p, this shows that for all p > 0 we have A = Ilftl/fc). If also p ^ Ct/k then this 
implies that A = A2t(p), finishing the proof. □ 

We need a similar result comparing the measures of t-stars and (t, 1)-Frankl families. 

Lemma 9.7. Let t ^ 2, k ^ t + 1 and n = {t + 1 + p){k — t) + t + 1, where p > 0. Let m be the measure of 
a t-star, and let mi be the measure of a (t, 1)-Frankl family. Then 


A> 


t{k-t+l)-2t + 2 


= nt T- 


m — mi 
m 


IIt(min(p, 1)), 


lim 

p—►OO 


m — TTii 

m 


= 1 . 
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Proof. We have 


Computation shows that 


n — t 
k — t 


, mi = (t + 2) 


n — t — 2 
k — t — 1 


n — t — 2 
k — t — 2 


m — mi ^ (t + 2){n — k)(k — f) + {k — t){k — t — 1) 
mi (n —t)(n —t—1) 

If n = (t + l)(fc — t + 1) + r then calculation shows that 


mi 


mi r(r + t{k — t) + 1) 
(n — t)(n — t — 1) 


> 


t — 1 r(r + t{k — t)) 
— t (n — t — 1)2 


Substituting r = {k — t)p, we obtain 

m — mi n — t — 1 {k — t)^p{p + t) n — t—1 p{t + p) 
mi '' n — t {k — tY{t+l + pY n — t {t + 1 + pY 

This shows that limp^oo(w —wi)/mi = 1. Since n ^ (t+l)(fc —1+1) > t + 2 implies {n — t—l)/{n — t) > 1/2, 
we also get 

m — mi pit + p) 
mi ^ 2(t + 1 + pY 

As p —> 00 , the lower bound tends to 1/2, and in particular, we can find Ct such that for p ^ ct we have 
(m — mi)/mi > 1/3. When p ^ ct, we clearly have (m — mi)/mi = i^tip), completing the proof. □ 

The method of Lovasz [24] as refined by Friedgut m allows us to deduce an upper bound on ||/^‘P for 
the characteristic function of a t-intersecting family. 

Lemma 9.8. Let t ^ 2, k ^ t+1 and n = it+l){k — t+l)+ r, where r > 0. he at-interseeting 

family, and f its characteristic function. Then 

||/>‘||2 = o ^max ■ (m-E[f]), where m = 

Proof. Let A be the matrix from Theorem 19.51 Since ||/^°|| = 1E[/], 

E[/] = </, A/> > XoE[fY + Xt+2\\rY. 

This already implies that E[/] ^ Ag ^ = m. Since Aq = m~^ and E[/] ^ m, we conclude that 

^ E[/]-m-iE[/]2 ^ E[/](l-m-iE[/]) ^ m - E[/] ^ 

Xt+2 Xt+2 Xt+2 

Lemma 1121 completes the proof. □ 

In order to prove our stability result, we need a result on cross-intersecting families. 

Theorem 9.9 ( [lb]). Let T c (["]) and Q c ^M) he cross-intersecting families: every set in T intersects 
every set in Q. If n ^ a + b and b ^ a then 

We can now prove our stability result. 
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Proof of Theorem 1 9. 31 In what follows, all big O notations depend on t and A. We can assume that n 
is large enough (as a function of t and A), since otherwise the theorem is trivial. We use the parameter 
p = {k — t + l)/n which satisfies A/2 < p < l/{t + 1). 

Let / be the characteristic function of so that ]E[/] = m — e, where m = {^ZD/Ck) ■ Lemma 1^751 shows 
that ||/^*P = 0(max(A:/r, l))e, and so Theorem 18.51 shows that ||/ —gp ^ S for the characteristic function g 
of some family Q depending on J = Jj coordinates, for some constant Jt, where 5 = 0(max((fc/r)^/‘^, 1)6^/*^ + 
here we use the fact that A ^ k/n ^ 1/2. We want to show that if <5 is small enough (as a function 
of t) then Q must be a t-star or a {t, 1)-Frankl family; if <5 is large then the theorem becomes trivial. 

We start by showing that if <5 is small enough then Q must be t-intersecting. Suppose without loss of 
generality that Q depends only on the first J coordinates. We will show that J = Q\[j] ^ {0,1}“^ must be 
t-intersecting. If J is not t-intersecting, then pick A,B e J which are not t-intersecting, with |A| > \B\. Let 
-4 = {5 e : A u 5 e and S = {S' e :B~oSeF}. Since n > (t + l)fc - - 1) and k > An, 

if n is large enough then (fc — |A|) + (fc — |i3|) ^ n — 2J, and so Theorem 19.91 shows that |^| + \B\ < 
Therefore 

1/ - »l= = kjAffl » , 0 {:^)) - «(i). 

using Lemma 1331 (for large enough n) and the fact that p > A/2. We conclude that if <5 is small enough, 
J = Q\[j] must be t-intersecting. 

Next, we show that if 5 is small enough then Q must be either a t-star or a (t, 1)-Frankl family. If Q 
is neither then Pp{J) < p* for all 0 < p ^ l/{t + 1) by Theorem 19.41 and in particular, since p > A/2, 
k-piJ) < p* — 7 for some 7 > 0; here we use the fact that there are finitely many t-intersecting families on J 
points. Since Vk{J) = Mp(i7)(l ± 0{l/n)) due to Lemma [3l4l for large enough n and small enough e we have 

11/ - pp > (E[/] - E[5])2 > (7(1 ± 0(l/n)) - e)2 = 11(1). 

We deduce that if n is large enough and e is small enough then Q is either a t-star or a (t, 1)-Frankl family. 

It remains to show that if e ^ At^\ min(r/fc, 1)*^+^ then Q cannot be a (t, 1)-Frankl family. Define 
T = min(r/fc, 1). Let rrii be the measure of a (t, 1)-Frankl family. Lemma 19.71 shows that m — mi = D(t) 
(since p > A/2 implies m = D(l)). Therefore if is a {t, 1)-Frankl family then E[p] ^ m — ^(t). On the 
other hand, E[p] > E[/] — 6 = m — e — 0((e/T)^/‘" + 1/n^^^). Put together, we obtain 

D(r) ^ e + 0((e/r)fo^ + l/n^^). 


Choose a constant c so that e ^ cr implies 

D(t) ^ 0((e/T)i/^ + 1/nfo'^); 

if e > cr then the theorem becomes trivial. The inequality implies that = 0[e/T) and so = 0(e), 

contradicting our assumption on e for an appropriate choice of At^\. □ 

Our conjecture on the optimal error bound in Theorem 18.51 implies an error bound of Ot,A(max(fc/r, l)e) 
in Theorem 19.31 


10 Open problems 

Our work gives rise to several open questions. 

1. Prove (or refute) an invariance principle comparing Vpn and jp^p for arbitrary (non-harmonic) multi¬ 
linear polynomials. 

2. Prove a tight version of the Kindler-Safra theorem on the slice fTheorem 18.5|) . 
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3. The uniform distribution on the slice is an example of a negatively associated vector of random variables. 
Generalize the invariance principle to this setting. 

4. The slice can be thought of as a 2-coloring of [n] with a given histogram. Generalize the invariance 
principle to c-colorings with given histogram. 

5. The slice has a g-analog: all fc-dimensional subspaces of for some prime power q. The analog 
of the Boolean cube consists of all subspaces of weighted according to their dimension. Generalize 
the invariance principle to the g-analog, and determine the analog of Gaussian space. 
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